Picture for Bo An

Bo An

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Add code
May 01, 2025
Viaarxiv icon

MF-LLM: Simulating Collective Decision Dynamics via a Mean-Field Large Language Model Framework

Add code
Apr 30, 2025
Viaarxiv icon

Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation

Add code
Apr 22, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

Generative Auto-Bidding with Value-Guided Explorations

Add code
Apr 20, 2025
Viaarxiv icon

LLM$\times$MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources

Add code
Apr 08, 2025
Viaarxiv icon

From Understanding to Excelling: Template-Free Algorithm Design through Structural-Functional Co-Evolution

Add code
Mar 13, 2025
Viaarxiv icon

Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning

Add code
Mar 10, 2025
Viaarxiv icon

LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification

Add code
Feb 24, 2025
Viaarxiv icon

Solving Urban Network Security Games: Learning Platform, Benchmark, and Challenge for AI Research

Add code
Jan 29, 2025
Figure 1 for Solving Urban Network Security Games: Learning Platform, Benchmark, and Challenge for AI Research
Figure 2 for Solving Urban Network Security Games: Learning Platform, Benchmark, and Challenge for AI Research
Figure 3 for Solving Urban Network Security Games: Learning Platform, Benchmark, and Challenge for AI Research
Figure 4 for Solving Urban Network Security Games: Learning Platform, Benchmark, and Challenge for AI Research
Viaarxiv icon